Lexical Diversity in Statistical and Neural Machine Translation
نویسندگان
چکیده
Neural machine translation systems have revolutionized processes in terms of quantity and speed recent years, they even been claimed to achieve human parity. However, the quality their output has also raised serious doubts concerns, such as loss lexical variation, evidence “machine translationese”, its effect on post-editing, which results “post-editese”. In this study, we analyze outputs three English Slovenian diversity different genres. Using both quantitative qualitative methods, one statistical two neural systems, compare them a reference translation. Our analyses based metrics show diverging results; however, particularly ones, mostly exhibit larger than counterparts. Nevertheless, method shows that these are not always reliable tool assess true lot “creativity”, especially by is often unreliable, inconsistent, misguided.
منابع مشابه
Lexical Micro-adaptation in Statistical Machine Translation
We introduce a generic framework in Statistical Machine Translation (SMT) in which lexical hypotheses, in the form of a target language model local to the input sentence, are used to guide the search for the best translation, thus performing a lexical microadaptation. An instantiation of this framework is presented and evaluated on three language pairs, where these auxiliary hypotheses are deri...
متن کاملImproving Lexical Choice in Neural Machine Translation
We explore two solutions to the problem of mistranslating rare words in neural machine translation. First, we argue that the standard output layer, which computes the inner product of a vector representing the context with all possible output word embeddings, rewards frequent words disproportionately, and we propose to fix the norms of both vectors to a constant value. Second, we integrate a si...
متن کاملLexical Features for Statistical Machine Translation
Title of dissertation: LEXICAL FEATURES FOR STATISTICAL MACHINE TRANSLATION Jacob Devlin, Master of Science, 2009 Dissertation directed by: Professor Bonnie Dorr Department of Computer Science In modern phrasal and hierarchical statistical machine translation systems, two major features model translation: rule translation probabilities and lexical smoothing scores. The rule translation probabil...
متن کاملLexical Syntax for Statistical Machine Translation
Statistical Machine Translation (SMT) is by far the most dominant paradigm of Machine Translation. This can be justified by many reasons, such as accuracy, scalability, computational efficiency and fast adaptation to new languages and domains. However, current approaches of Phrase-based SMT lacks the capabilities of producing more grammatical translations and handling long-range reordering whil...
متن کاملNeural Machine Translation Advised by Statistical Machine Translation
Neural Machine Translation (NMT) is a new approach to machine translation that has made great progress in recent years. However, recent studies show that NMT generally produces fluent but inadequate translations (Tu et al. 2016; He et al. 2016). This is in contrast to conventional Statistical Machine Translation (SMT), which usually yields adequate but non-fluent translations. It is natural, th...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Information
سال: 2022
ISSN: ['2078-2489']
DOI: https://doi.org/10.3390/info13020093